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^ : Abstract 

' We give a geometric description of variational principles in mechanics, with 

special attention to constrained systems. For the general case of nonholonomic 
constraints, a unified variational approach is given, and the equations of mo- 
tion of both vakonomic and nonholonomic frameworks are obtained. We study 
specifically the existence of infinitesimal variations in both cases. When the 
constraints are integrable, both formalisms are compared and it is proved that 
they coincide. As examples, we give geometric formulations of the equations 
of motion for the case of optimal control and for vakonomic and nonholonomic 
mechanics with constraints linear in the velocities. 
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1 Introduction 

To describe the motion of mechanical systems there is a variety of mathematical 
models which are based on different principles. Most of the physical models are 
obtained using an appropriate variational principle in a certain evolution space. But 
variational principles are not only important in physics, but also in many branches 
of engineering ||14| or economics |p, where one is interested in optimizing a given 



functional, possibly subject to some restrictions. In fact, constraints are ubiquitous 
in many mechanical systems and much more different situations. 

In this paper we are going to study Lagrangian systems, i.e., dynamical systems 
in which the equations of motion are obtained by finding the critical paths of a 
functional 

/ Ldt , 

where L is a function defined on the tangent bundle TQ of a given differentiable 
manifold Q, the configuration space. We will not consider arbitrary variations, 
but only variations satisfying certain conditions. These conditions arise from some 
given constraints on the dynamics of the system. We will analize the case when the 
constraints are defined by a certain submanifold C of the tangent bundle TQ. Such 
constraints are usually called nonholonomic. 

There are two different approaches when dealing with constraints. The first one 
is based on the idea of understanding the constraints as constraint forces. This 
point of view, that seems very natural in a physical context, gives rise to the classi- 
cal d'Alembert-Lagrange principle. Mechanics of Lagrangian systems with nonholo- 
nomic constraints based on this principle is called nonholonomic mechanics 113 



But there is another different point of view, that seems more natural when one is 
interested in optimizing a given functional defined as above when there are con- 
straints. For example, if we wish to change the state of a given system minimizing 
a cost functional (a typical problem in engineering or economics), it is not natural 
to understand the constraints as forces acting on the system. In this case one is in- 
terested in minimizing the functional considering only the variations allowed by the 
constraints. Mechanics of Lagrangian systems with nonholonomic constraints based 
on this idea is often called vakonomic mechanics (mechanics of variational axiomatic 
kind 0). For example, optimal control theory is a typical example of vakonomic 
mechanics. It is interesting to notice that both mechanics do not coincide in gen- 
eral, but they agree when the constraints are integrable. Several references on these 
topics are [0, 0], § and [0. See also [g 



The paper is organized as follows. In section 2 we give a generalized notion of 
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a variational problem. Section 3 is devoted to study vakonomic mechanics from a 
geometric point of view, with special attention to the existence of admissible vari- 
ations in order to obtain the equation of motion. In section 4 we do the same 
for nonholonomic mechanics. Both mechanics are understood as generalized varia- 
tional problems. In section 5 it is proved that, when the constraints are integrable, 
vakonomic and nonholonomic mechanics coincide. In section 6, as an example of 
vakonomic mechanics, a geometric formulation of optimal control theory is studied. 
Finally, in section 7, we give a geometric formulation of vakonomic mechanics when 
the constraints are defined by a distribution (linear constraints in the velocities). 

Basic knowledge of differential geometric structures is assumed. The presenta- 
tion is almost self-contained but the interested reader may consult the bibliography 
for more specific topics as the vertical lift, the fibre derivative and the canonical 
involution or the Euler-Lagrange operator [^. 

2 Variational problems 

2.1 Elements of a variational problem 

First, we are going to define what we mean by a variational problem. A variational 
problem consists of the data {Q, L, C, C, W) where: 

• Q is a n-dimensional different iable manifold, the configuration space. 

• L is the lagrangian function defined on the tangent bundle, L: TQ — > M. 

• C is the constraint submanifold, and it is a submanifold of TQ. 

• C is the family of admissible paths. Given two points qi,q2 G Q and a compact 
interval / = [ti, ^2], we will say that a path 7: / — Q of class is admissible 
if: 

7(^1) = Qi, lih) = q2 and 
7(t) e C, for all t G /. 

• W are the admissible variation fields (or infinitesimal variations). For a given 
admissible path 7, consists on a certain set of vector fields along 7. 

Notice that we do not consider variations of a path 7, but variation fields along 7. 
Now we are ready to define the variational problem associated to (Q,L,C, C, W). 
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The action of L along a path 7 is the functional S: C — > M given by the integral 

Sb]= [Lm)dt. (2.1) 



A variational problem consists in finding the critical admissible paths of the 
functional S, in a sense that will be precised later. 

2.2 Variations and variation fields 

Let 7: / — > Q, 7(ti) = Qi, 7(^2) = ?2, be an admissible path. A variation of 7 is a 
function F: (-5, 5) x I — > Q such that: 

1. = r{e,-) is a one-parameter family of paths defined on / with fixed end- 
points, T{e,ti) = qi, We G (—6,6), i = 1,2, and 

2. r(0,t) = j{t), e / (if there is no variation, £ = 0, we obtain the original 
path 7). 

Given a function F(£, t) of two real variables, we will denote its derivatives with 
respect to e and t as F' and F, respectively. It is clear that F' and F are vector fields 
along F. 

Now, we are in conditions to define a variation field of F. 

Definition 1 Given a variation T{e,t) of an admissible path j, the variation field 
of F is the vector field w along 7 defined by 



w(^)=T(o,)F- A 



= F'(0,i). 

(o,t) 



Notice that 

w(ii) = 0, w{t2) = , 

since the F^ have fixed end-points. 

Therefore, given a family of variations of 7, we can associate to them a family of 
variation vector fields along 7. We will say that a variation F of 7 is admissible if its 
associated variation vector field along 7, w{t) — F'(0,i), is an admissible variation 
field of 7, i.e., w e Wj. 

We finish this description about variations and variation fields with a useful 
lemma, whose proof is straighforward in local coordinates. 

Lemma 1 For any w vector field along 7 and any function A: / — >• M, 

s o (Aw) ■ = (DA) vl(7, w) -I- A s o w . 

□ 
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Notice that, if w is a vector field along 7, then w is a vector field along w and 
so w is a vector field along 7. The function A: / — > M denotes a function of time, 
and it is clear that Aw is another vector field along 7. D is the usual derivative with 
respect to the time. The map vl denotes the vertical lift vl: TQ Xg TQ T(T(5). 
Its local expression is vl(g,f,-u) = (g, f;0,-u). Finally, s: T(T(5) T(TQ) denotes 
the canonical involution, which is an isomorphism between the two vector bundle 
structures of T{TQ). Its local expression is s(g, v] u, a) = {q, u; v, a). 

2.3 Critical admissible paths 

Definition 2 An admissible path 7 is said to be critical if, for each admissible 
variation F^, the first variation o/S'fFe] is zero; i.e., 



= 

£=0 



for each admissible variation of ^. 



The main purpose of this paper consists in discussing the criticity conditions for 
different variational problems and describing the solutions. First of all, we are going 
to describe the criticity condition for a general problem. 

It is clear that, if (7: Q — >^ K is a function, then, for any function F(£, t) (F: U C 
IR^ — > Q) of two real variables, 

^^?(F(£,t)) = (d^(F(£,i)),F'(£,t)), 

and similarly for d/dt. 

Now, suppose that F(£, t) is a variation of a path 7. Let us consider F: [—5, 5) x 
/ — > TQ. Derivation of F with respect to e and t yields (F)' and (F)', which are 
now vector fields along F. Taking £ = yields two vector fields along 7, which are 
sow and 7. 

Then, if /: TQ ^ R is a function, we have 



d_ 



/(F(£,i)) = (d/(7(^)),s(w(^))). (2.2) 

£=0 



(Remember that w is a vector field along w, and so w is a vector field along 7, so 
the contraction makes sense.) 

Now, we can characterize the criticity condition in a more manageable way. 
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Proposition 1 Given a variational problem {Q,L,C,C,yV), an admissible path 7 
is critical if and only if 



I 



(dL(7(t)),s(W(t))) dt = 0, 
for each admissible vector field w G W^. 
Proof: Using in ([21] ), we obtain 

^S[T,] = J (dL(7(t)),s(w(t)))dt, 

e=0 

and the result follows. □ 

Observe that this condition does not depend on the full variation T{e, t), but only 
on its variation field (see also ||^). Therefore, in our study of variational calculus, 
we will shift our attention to infinitesimal variations rather to finite variations. 

2.4 The Euler-Lagrange operator 

To obtain a more manageable condition of criticity, it is convenient to perform an 
integration by parts. First, let us define the Euler-Lagrange operator of L. 

Definition 3 The Euler-Lagrange operator associated with a function L: TQ — > M 
is a mapping Sl'- T^Q T*Q defined by the relation 

{Sl o 7, w) = {dL o 7, s o w) — D{VL o 7, w) , 

for any path I —y Q and vector field w along 7. 

Here, the map VL: TQ T*Q is the fibre derivative of L. Recall that, given 
a vector bundle E B, if f: E ^ is a function, then the derivatives of the 
restrictions of / to the fibres define the fibre derivative of /, which is a map Vf: E 
E*. Its local expression is Vfip; a) = (6; df /da). 

It is easy to check (in coordinates) that the Euler-Lagrange operator is well- 
defined by this relation. 

Therefore, the Euler-Lagrange operator is a one-form along the projection T^Q 
Q, and also an affine bundle map along TQ —>■ Q. The expression in local coordi- 
nates of £l is the usual one, 

fdL d fdL\\ , 

^' = [-d^-dt[d^r'^ 
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where d/dt is the total time- derivative operator. The Euler-Lagrange operator can 
be extended in the same way to a time dependent Lagrangian. 

Using the definition of the Euler-Lagrange operator, we can characterize the 
criticity condition in the usual form. 

Theorem 1 Given a variational problem {Q,L,C,C,yV), an admissible path 7 is 
critical if and only if 



for each admissible variation field w G W-y. 

Proof: From proposition Q and the definition |^ of the Euler-Lagrange operator 
we obtain that 



The result follows observing that, since w is a variation field, the last term vanishes 



For the case when there are no constraints, C = TQ and W-y is the set of all the 
vector fields along 7, we obtain the well-known Euler-Lagrange equation. 

Corollary 1 Given a unconstrained variational problem, a path 'y & C is critical if 
and only if 



In this paper we will be interested in variational problems when C C TQ, C 7^ 
TQ. Hence, given a set of admissible paths, it is necessary to select a set of admissible 
variation fields (or infinitesimal variations) along the admissible paths. We will 
consider two different approaches to this problem. The first one is vakononomic 
mechanics, which can be considered as a strictly variational approach. The second 
one is nonholonomic mechanics, which is variational in our generalized sense, but 
not in the classical one. Nonholonomic mechanics is the usual way to describe the 
dynamics of a mechanical system with constraints. In the next two sections we will 
describe the dynamical equations obtained in each case. It is interesting to remark 
that both approaches are equivalent when the constraints are integrable (holonomic 
constraints) . 

We finish this section with a useful property of the Euler-Lagrange operator that 
will be used in many calculations in the following. The proof is straighforward in 
local coordinates. 






□ 



£l o 7 = . 
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Lemma 2 For any /: TQ M and /i: / ^ M (a function of time), 

where t\ \ T^Q TQ is the canonical projection. □ 

3 Vakonomic mechanics 

Roughly speaking, vakonomic mechanics is the result of variational calculus when the 
variations are restricted by some constraints on the positions and also the velocities. 

Our initial setting is therefore a submanifold C C TQ of codimension m < n; let 
us denote by j the inclusion of C in TQ. A constraint is any function (p vanishing 
on C. Locally C is defined by the vanishing of some constraints 0*: TQ M 
= 1, . . . , m) whose differentials d0* are linearly independent at each point of C . 

We will assume that the projection of C to Q, tq o j: C — ^ Q, is a submersion. 
It can be easily proved that this statement is equivalent to say that the constraints 
0* can be chosen such that their fibre derivatives 1^0* are linearly independent at 
every point of C. In local coordinates, this means that dc^)^ jdv^ has maximal rank. 
That is, the constraints restrict the velocities, not the positions. 

With the assumptions above, the image (tq o j)(C) C Q is open, so we may 
assume that the projection C ^ Q is a surjective submersion. Then there exists the 
vertical subbundle V(C) C T(C), which has rank n — m (the dimension of the fibres 
of the submersion). Indeed, at each Vg E C we have V^g(C) = T„^(C) fl V^^iTQ). 

To obtain the equations of motion of vakonomic mechanics, we need first to 
describe which are the admissible variations. 

3.1 The variations of vakonomic mechanics 

We remember that an admissible path is a mapping j: I —>■ Q such that 7 takes 
its values in the submanifold C C TQ. Due to our assumptions on C, there exist 
vector fields locally defined on Q taking values in C. Their integral curves have their 
derivatives in 

C, so there are many admissible paths. 

Definition 4 Let T be a variation of an admissible path 7. The variation T is called 
a strongly admissible variation of 7 if every path is admissible. 

If F is a strongly admissible variation, then (j){r{e,t)) = 0, for any constraint 0. 
Taking the derivative with respect to e at e = and using ( p.2| ) we have 

(d0 o 7, so w) = 
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for every constraint 0. This can also be expressed as 

s(w(t))GT^(,)(C) 

for each t & I. 

Definition 5 A variation field w of an admissible path 7 is called an admissible 
variation field for a given vakonomic problem if 

s(w(t))GT,(,)(C), 

that is, (d0 o 7, so w) =0. 

From the definition of the Euler-Lagrage operator (definition we obtain that 
w is a variation field if and only if 

{S^ o 7, w) = -D{V(j) o 7, w) 

for every constraint 0. 

It is important to remark that an admissible path may not have any nontrivial 
strongly admissible variation, and so an admissible variation field may not arise from 
a strongly admissible variation. One may say that the variations having admissible 
variation fields are the variations that preserve the constraints up to first order in e. 
These variations may be called weakly admissible variations. 

Next, we are going to give a more detailed description of admissible variation 
fields. Among all the vector fields w along 7, we consider a particular submodule. 
Take the subbundle 

/.^c7*T(g) = /x,T(g) 

whose sections are the vector fields w along 7 of class whose vertical hfts vl(7, w) 
are tangent to C. Using this subbundle we can express the admissible variation fields 
in a more manageable way. First, notice that, since I is an interval, both and 
7*T((5) are trivializable. Therefore there exists a global frame for 7*T((5), (w^) 
{k = 1, . . . ,n). Since is a subbundle of rank n — m, we can assume that the last 
n — m of the span this subbundle. 

Any vector field along 7 can be thus uniquely written w = I]fc=i -^^w^, where A'^ 
are functions of time. Then w is a variation field if and only if the coefficients A'^ 
vanish at the end-points of /. Moreover, according to definition H, w is an admissible 
variation field if it is a variation field and (d0*o7, sow) = 0, for i = 1, . . . , m. Taking 
into account lemma |I], this condition can be written 

n n 

{V<P' o 7, Wfc)DA'= + o 7, s o Wfe) A^ = . (3.1) 

k=l k=l 



X. Gracia et al Variational calculus in constrained systems 



10 



Notice that, since the fibre derivatives Vc/)'^ are hnearly independent at each point, 
the matrix with entries 07, w^) has maximal rank, m. By the special choice of 
the Wfc, the last n — m of them vanish under the and hence the square matrix 
A = {{V(ff o 7, Wj))j is invertible. So, writing the equation as 

m m n 

Y.A)m^ + Y.B)\^+ c\\' = Q, (3.2) 

j=l j=l l=m+l 

we can isolate the DA-' (j = 1, . . . , m) linearly in terms of all the A^ {k = 1, . . . ,n). 
This determines uniquely A-', j = 1, . . . , m as functions of A', Z = m + 1, . . . , n, due 
to the initial condition A-'(ti) = 0. However, not any A', I = m + 1, . . . ,n vanishing 
on ti, t2 are admissible. Notice that the solutions A-', j = 1, . . . ,m must vanish also 



in ^2- In fact, the existence of solutions of (|3.2| ) satisfying A (ti) = A (^2) = 0, 



A; = 1, ... n, is not guaranteed in principle. If we write ( p.2| ) as 

A A(i) = —B A(i) — C A(2) , 
the solution satisfying A(i)(ti) = is 

A(i)(t) = -u{t) fHs)]-' A-\s) C{s) A(2)(s) ds , 
Jti 

where h>{t) is the fundamental matrix of the homogeneous system ^ A(i) = —B A(i) 
satisfying the initial condition i'l{t2) = If A(i)(t2) = 0, then necessarily 

f^Hs)]-^ A-\s) C{s) A(2)(s) ds = . (3.3) 

If the system is homogeneous (C = 0) we obtain the trivial solution \^{t) = 0, 
j = 1, . . . , m, remaining A^, Z = m + 1, . . . , n as arbitrary functions satisfying the 
boundary conditions A'(ti) = A'(t2) = 0. As for the general case, in the following 
section we will show that admissible variations always exist in vakonomic mechanics. 

Condition (|3.1|) is very useful to study variation fields in vakonomic mechanics, 
as we show in the following example. 

Example Let Q = be the configuration space, with coordinates (x, y) , and con- 
sider a Lagrangian function and a constraint both depending only on the velocities, 
i.e., L = L{x,y) and = (j){x,y). 

From our assumptions on the constraints we can write locally (j) = y — f{x) = 0. 
In this case, using theorem ^ it is a simple calculus to show that the equations of 
motion of the vakonomic problem are x = 0, that is x{t) = at+b and y{t) = f{a)t+c. 
The parameters a, b and c are obtained from the boundary conditions x{ti), y{ti), 
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x{t2) and y{t2). If •yit) = {x(t),y(t)) is a straight line satisfying the boundary 
conditions, using ( p. ID , the reader can check that there exist admissible variation 
fields, and they are vector fields along 7(t) of the form A(t)w, where A(ti) = A(t2) = 
and w{x,y) = {x,y; l,/'(a)). 

For example, if (f){x, y) = y — vT+i^ = and x{t) is a linear function of time, 
there are not strongly admissible variations (see 0). But there exist admissible 
variation fields, so there are weakly admissible variations. In fact, the weakly admis- 
sible variations T{e, t) = {x{e, t), y{e, t)) have the form x{e, t) = x(t) + X(t)e + o(e), 
y(e, t) = y{t) + (a/Vl + a2)A(t)e + o(e), where A(ti) = Xiti) = 0. 

3.2 The equations of motion of vakonomic mechanics 

As we have shown, a critical path of the action with constraints is an admissible path 
7 such that J {£i{'j(t)),w(t)) dt vanishes for each admissible variation field w (theo- 
rem |l]). To obtain the corresponding Euler-Lagrange equation, we first establish the 
following proposition. 

Proposition 2 Given a variational problem {Q,L,C,C,W), where W are the va- 
riation fields satisfying definition let 7 be an admissible path. Then, for any 
family of functions Hi{t), i = 1, . . . ,m, the first-order variations of the J^Ldt and 
J^{L + YlT^i f^i4>^) dt with respect to an admissible variation field w G W coincide. 

Proof: In principle, since the variations may not be strongly admissible, it is 
not clear that the variations of both actions yield the same result. However, using 
theorem ^ and definition ^ the difference of the first-order variations of the actions 
is 

/ fe'" M.0^°')''^)d^ = £ / /i^(d0^o7,sow)dt-£ /D(/i,(P0^o7),w)dt , 

and both terms vanish whenever w is an admissible variation field. Therefore, the 
variations of the two actions coincide when w is an admissible variation field. □ 

Theorem 2 Given a variational problem {Q,L,C,C,W), where W are the varia- 
tion fields satisfying definition let 7 be an admissible path. Then 7 is critical if 
and only if there exist functions ^j: I ^W, j = 1, . . .m, such that 

This is the equation of motion of vakonomic mechanics. 
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Proof: If equation ( p.4|) holds then, for each admissible variation field w, Jj{Si^_^j2"" fit<j>^° 
7, w) dt = 0, which, according to Proposition (^, is equivalent to Jj{£l°1, w) dt = 0. 
This shows that 7 is a critical path. So it remains to prove the converse: that equa- 
tion (|3.4| ) is a necessary condition for the criticity of an admissible path 7. First, 
notice that the /Xj can be chosen such that 

°7.Wj) = (3.5) 
for j = 1, . . . , m. Indeed, by lemma |^ this equation can be written as 

m m 

{8l o 7, Wj) + E (^0' ° 7, Wj) - E {Vcj)' o 7, Wj) = 

i=l i=l 

for each j = 1, . . . ,m. From definition ^ and the choice of Wj we have 

{£^^ o7,Wj) = (d0*o7, so Wj) - D(X>0* o7,Wj) = (d0' o 7, so w^-) , 
for j = 1, . . . ,m. That is, we have 

(D/ii) A) - /i, B) -Dj = 0, (3.6) 



where A and B are the matrices we have used previously ( |3.2|) . So again we have a 
linear differential equation that determines the functions /Xj (up to initial conditions). 
From now on we assume that A is the identity matrix; this can be easily done through 
a linear change of the basis (wj). 



If we apply the variational principle for the modified Lagrangian L + 
we have 

n ^ 



i+EIliA'^'/'' °7' Wfc)A dt = 



for each set of functions A'^ yielding an admissible variation field. 

If we choose the functions /ij satisfying (|3.5|) , then the sum is only from m + 1 
to n: 



=m+l 



This must hold for any choice of the functions A™^^, . . . , A" giving an admissible 
variation field. However, as we have shown in the preceding section, the functions 
A™"^^, . . . , A" are not arbitrary in general, due to the final conditions A^(t2) = ■ ■ ■ = 
A™(t2) = 0. Let {fii)i<i<m be the particular solution of (|3.6|) satisfying fii(t2) = 0, 
and let (z/-^) be the transpose of the fundamental matrix of the homogeneous system 
of ( |3.6| ) satisfying the initial condition iyl{t2) = Notice that u = {i^)^^- (In 
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general, if z/ is a fundamental matrix of x = A ■ x, then is a fundamental 

matrix of x = —A* ■ x). Then the general solution of ( W^ is fii = fii + Y^]LiPji^l, 
where pj are arbitrary constants. Suppose for a while that there exist admissible 
variation fields w = I]fc=i ^^^k with 

for / = m + 1 . . . , 72. Then ( p.7| ) is a vanishing sum of integrals of squares, which, 
combined with ( |3.5| ), yields the equation of motion ( |3.4| ). It remains to prove that 

the choice of such A', I = m + 1, . . . ,n, gives an admissible variation field. From 
(3.3) and u = the variations defined by (3^) are admissible if and only if 

J^^ C(s) [{^l+j:ZM+J:T=iPj^D^' ° ^(2))] ds = , (3.9) 

where W(2) denotes the last n — m vector fields. Now, from lemma ^ and definition 
H, we have 

mm m 

(^E"L i^'</'^°^'^') = w/)-Xl(Dz/*)(P0^o7, wi) = ^i7j(d^o7,sowi rangle 

' ' ' j=i j=i j=i 

(3.10) 

Using that C\ = (d0*o7, so w^) and combining ( p.9| ) and ( p.lO| ), we obtain the linear 
system for the pj 



m ^2 " 

/i=l l=m+l ^ ^ 

rt2 



= -/ (^2":^p'.</.^' °7,w/) (f^+^-^^^^fc o7,w/);. (3.11) 

l=m+l ^ ^ 

If this system has any solution, then we can find values for pj, j = 1, . . . , m, such that 
the functions A' defined in ( p.8| ) give rise to admissible variations. Now, we prove 
that this system has always solution. Consider the pre-Hilbert space of continuous 
vector- valued functions f : [^1,^2] M"^™ with the usual scalar product (f , g) = 
J2?=m+i Iti fi'Qi- Let V be the finite-dimensional subspace spanned by the m vectors 



as 



z = 1, . . . , m. Then we can write the system (|3.11[ 

m 
h=l 

where v = -((^^+^- ^^^^^ o 7, w^+i), . . . , ^^^^^ o 7, w„)). 
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The solutions of this system are any constants ph such that Ylh^=i Ph ■ is the 
orthogonal projection of v onto V . This is well defined, since V is finite-dimensional. 
(Notice that the ph may not be unique, since the are not necessarily independent). 
This completes the proof. □ 

Remark: Notice that, using lemma ^ the equation of motion may also be 
written as 

m 

° 7 = E ((D/i.) ^^0' ° 7 - /ii ^<A» ° 7) • (3.12) 
1=1 

Remark: In the proofs of the equation of motion of vakonomic mechanics that one 

can find in the literature, it is usually assumed that the functions A', / = m+1, . . . , n, 
giving the admissible variations are free. Then, the equation of motion is obtained 
as a straight consequence of (^3)- However, in general, these functions are not 
absolutely free. 

4 The variations and equations of motion of non- 
holonomic mechanics 

In this section we are going to show that nonholonomic mechanics may be under- 
stood clS cl variational problem. 

Our initial setting is also the submanifold C C TQ, which may be locally defined 
by the vanishing of the constraints (jf. An admissible path is still a path j: I Q 
such that 7 takes its values in C. Let us define which are the admissible variation 
fields in nonholonomic mechanics. 

Definition 6 A variation field w of an admissible path 7 is called an admissible 
variation field (in nonholonomic mechanics) if it is a section of the subbundle C 
7*TQ. That is, 

vl(7(t),w(t))GT,(,)(C). 
Using the constraints, equivalent statements are 

(d0o7,vl(7,w)) = O, (4.1) 

or 

(P0o7,w) = O, (4.2) 

for each constraint (j). 

Notice the key difference with respect to vakonomic mechanics: now the admis- 
sibility is a C^(/)-linear condition on w. This linearity makes things easier. Next, 
we obtain the equation of motion of nonholonomic mechanics. 
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Theorem 3 Given a variational problem {Q,L,C,C,yV), where W are the admis- 
sible variation fields satisfying definition an admissible path 7 is critical if and 
only if there exist functions fij , j = 1, . . .m, such that 

m 

£LO^ = Y.^i^V(|)'o^. (4.3) 

i=l 

Proof: A critical path for nonholonomic mechanics is an admissible path 7 such 
that the first-order variation of the action, / (^.(7(t)), w(t)) dt, vanishes for each 
admissible variation field w. By equation ( [4. 21) , this means that Siijit)) is a linear 
combination of the P0* o 7. Thus, the result follows. □ 

Remark: It is obvious that there always exist admissible variation fields in 
nonholonomic mechanics. For example, if we calculate the admissible variation 
fields of the example in section 3, we will find that, in this case, they coincide with 
the admissible variation fields of vakonomic mechanics. But this will not be true in 
general if the constraints are not integrable. 



5 The case of integrable constraints 

Let us consider the problem of holonomic constraints in the usual sense. 

Definition 7 Given a differentiable manifold Q and a Lagrangian function L: TQ — 
M, a holonomic problem is a variational problem where 

• The constraint submanifold is given by a submanifold P G Q, thus C = TP. 

• Admissible paths are paths 7: / — > P G Q. 

• Variation fields w along 7 are admissible if they are tangent to P. 

Notice that, from any admissible variation field along an admissible path 7, one 
may construct a variation F contained in P. Therefore, it is clear that the problem 
with holonomic constraints is equivalent to the unconstrained variational problem 
defined on P by taking the restriction of the Lagrangian L to TP G TQ. 

Now, consider the cases of both vakonomic and nonholonomic mechanics when 
the constraints are defined by an integrable subbundle C G TQ. In this situation, 
we have the following equivalence. 

Theorem 4 If the constraint submanifold C is an integrable subbundle ofTQ, then 
both vakonomic and nonholonomic mechanics coincide, and they are equivalent to a 
holonomic constrained problem on each integral submanifold of C. 
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Proof: First of all, notice that, in both path 7: / — > Q is admissible 

(i.e., 7 is in C) if and only if it is contained in an integral submanifold P G Q of C. 
Recall that Tq{P) = Cq, for each point in P. 

Let w be a variation field of 7. We know that w is admissible in the nonholo- 
nomic framework if vl(7(t), w(t)) G T^(i)(C) (see definition H). Since the vertical 
lift restricts naturally to subbundles, this is equivalent to say that vl(7(t), w(t)) G 
y-y{t)C C V^(t)(T(5), that is, w(t) G C^(t) or, what is the same, w(t) G T^(()P. And 
this last condition says that w is admissible for the holonomic problem. There- 
fore, we have proved that a nonholonomic problem with integrable constraints is 
equivalent to a holonomic problem on each integral submanifold of C. 

Now, we show the equivalence with the vakonomic problem. Since C is an 
integrable subbundle of TQ, it is known that the integral submanifolds of C can be 
locally described as ^p = constant, for some independent functions ip on Q. This 
implies that the constraint submanifold C can be locally described by = d^^ = 0. 
Here, dip: TQ — > R is the differential dip of ip: Q — > M considered as a function on 
the tangent bundle. 

Let w be a variation field of 7, in the sense of vakonomic mechanics. Then, for 

any = dip, we have 

= {d(j) o 7, s o w) = ddip o s o w . 

Using the property ddip = ddip o s, we obtain 

= ddip o w = {ddip o w, w) = (d0 o w, w) = D(</) o w) = D{dip o 7, w) . 

Thus, w is an admissible variation field in vakonomic mechanics if and only if {dip o 
7,w) is constant. Since w(ti) = 0, this constant is zero, {dip o 7, w) = 0, which 
means that w is tangent to P. Therefore, w is admissible for the vakonomic problem 
if and only if it is admissible for the holonomic problem on P. □ 

6 Optimal control and vakonomic mechanics 

A problem of optimal control may be given by the following data: a configuration 
space B describing the state variables, a fibre bundle tt: M — > B whose fibres 
describe the control variables, a vector field Y along the projection of the bundle, 
Y:M — > TB, and a "Lagrangian function" L: M — > R. For a path 7: J M 
where vr o 7 (not 7!) has fixed end-points, the problem is to find an extremum of 
the action 

/ mt))dt 
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when 7 satisfies the differential equation 

(tt o 7)' = y o 7 (6.1) 

that rules the evolution of the state variables. 

It is easy to show that this is indeed a vakonomic problem on the manifold M, in 
which the Lagrangian L is very singular, since it does not depend on the velocities. 
The constraint submanifold C C TM, given by the differential equation above, is 

C = {wu G TM I TttK) = Y{u)} . (6.2) 

In this way, a path 7 is admissible if and only if it is a solution of the differential 
equation ( |6.1| ) or, equivalently, if it takes values in the affine subbundle C of TM. 
On the other hand, from the special characteristics of optimal control problems, we 
can relax the boundary conditions as we have done. Observe that theorems [l| and 
^ remain true since L does not depend on the velocities and the structure of the 
constraints ( |6.1D (they do not depend on the derivatives on the control variables). In 
coordinates, if x*, i = 1, . . . m, are local coordinates in B and (x*, u"), i = 1, . . . m, 
a = 1, . . . ,n — m, are local coordinates in M, the action is given by 

L(x\u")dt 



and the constraints are given by a set of first order differential equations 

= F*(x, u) , i = 1, . . . , m. 



with boundary conditions x\ = x*(ti) to {t2) (there are no boundary con- 

ditions on the control variables). Notice that for this vakonomic problem the con- 
straints are very particular: they express the velocities of the state variables in terms 
of the state and control variables. 

Let us identify which variation fields w are admissible. Notice that frames for 
the bundles C 7*T(M) are provided with (wa) and (wj, w^), where for instance 

d d 

if we have coordinates (x*, u") on M. Writing w = A*Wj -|- A"Wq,, the differential 
equation ( |3.1j ) turns out to be 

DA^ - I^A^- = I^A^ 
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Optimal control theory admits several geometric formulations and expressions 
of the equation of motion. First, we can give a lagrangian description on the con- 
figuration manifold T*B Xb M. We only need to define a lagrangian on its tangent 
space: 

C{x, p, u, X, p, u) = L{x, u) + {p,x — Y{x, u)) 

where we write (x; p, u) for the variables of T*B x b M — recall that T{T*B x b M) = 
T(T*B) Xtb TM; its elements are pairs of tangent vectors {x,p, u) projecting to the 
same tangent vector x. For a path rj on T*i? Xb M the Euler-Lagrange equation 
Sc o Tj is readily seen to be equivalent to the vakonomic equation (|3.4| ). 

Let us recall that, given a /-dimensional differentiable manifold Q with local co- 
ordinates (q^), there is a canonical tensor field on TQ, the vertical endomorphism 
S which is a rank-Z (1,1) tensor field on TQ such that Ker 5* = ImS* and whose 
Nijenhuis tensor A^^ vanishes. In natural coordinates {q^,q^), the local expression 
of 5* is given by 5* = dq^ Also, we have the Liouville vector field A (the 
infinitesimal generator of the dilations along the fibres on TQ) , whose local expres- 
sion is A = q^d/dq^. A vector field X in TQ is called a second order differential 
equation (SODE) if S{X) = A. Now, if C: TQ — > R is a Lagrangian function, 
we can construct the Cartan 1-form associated with C, given by 9c = S* o dC, the 
Cartan 2- form uc = —d6c and the energy function Ec = A(£) — C Then the paths 
rj solution of the Euler-Lagrange equations are the integral curves of a second order 
differential equation X in TQ satisfying the dynamical equation ix^c = dEc- 

Taking Q = T*BxbM a.ndC = L(x, m) + (p, x—y(x, m)), we obtain a geometrical 
expression of the equation of motion of optimal control theory, 

ix^^c = dEc , 

where Ec = {p, Y{x, u)) — L{x, u) and uc = —dOc = — d(pidx*) = dx* A dpi. 

Associated to this lagrangian description we can consider the hamiltonian for- 
malism of C This means to consider the manifold T*(T*i? x b M) with its canoni- 
cal symplectic structure, the Legendre's transformation of C, VC: T(T*B x b M) 
T*(T*5 X sM), and to push forward through it the energy function to a hamiltonian 
function U: T*{T*B x^ M) ^ M. 

However, the most interesting geometric description of optimal control theory is 
a presymplectic description which can be constructed on the manifold T*B x b M. 
Here we consider the 2-form uj obtained by pull-back through T*B Xb M T*B of 
the canonical 2-form of the last manifold. In local coordinates, u = dq A dp. Taking 
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the hamiltonian function defined by 

H{x, M, p) = {p, Y{x, u)) - L{x, u) , 
if is a path on T*i? M, the presymplectic equation 

irfUJ = dH O 7] 

is equivalent to the equation of motion of vakonomic mechanics, in the sense that 
there is a natural bijection between both sets of solutions. To show this is enough 
to write the local expressions. 

Remark: In optimal control theory the hamiltonian function is usually written 
as H{x,u,p) = {p,Y{x,u)) — fioL{x,u), where /iq = 0, 1. When = we recover 
the so called abnormal solutions (see |TT|). However, in the vakonomic approach, 
there are not abnormal solutions. The key issue is that we work with admissible 
variation fields, not admissible variation curves. 

7 Constraints defined by a distribution 

In this section we present a geometric framework for constrained systems when the 
constraint submanifold C is a distribution (or vector subbundle) of the tangent mani- 
fold TQ. In local coordinates, this means that the constraints are linear functions on 
the velocities. The subbundle C C TQ can be described in terms of its annihilator, 
(jo ^ T*(5. If this is locally described in terms of 1-forms, a* = a^(g)dg" {i = 
1, . . . ,m, where m is the codimension of C and (g") are local coordinates of Q), 
then C is locally described in terms of the constraints 

(j)'{vq) = {a'{q),Vq) =0, i = l,...,m. 

Let us consider the vector bundle TQ(BC^, in which we will set up the dynamics. 

On the one hand, given the Lagrangian function L on the tangent bundle TQ, 
let 6l = S* o dL be the Lagrange 1-form on TQ. Its pull-back along the projection 
TTi: Tg © ^0 — > TQ yields the 1-form 

ei = tiIOl 

on TQ © C". Also, using the Liouville vector field A on TQ, the energy function 
associated with L in TQ © C° is 



E = 7r*(A(L) -L). 
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On the other hand, let 9q be the canonical 1-form defined on the cotangent 
bundle T*Q. If jo: — > T*Q denotes the canonical inclusion and tt2: TQ(BC^ — > 

is the projection onto the second factor, then we can take the pull-back of these 
mappings to construct a 1-form 62 on TQ © C° as 

02 = (jo o t^2)*0q- 
Using the 1-forms di and 62, we have a presymplectic form 

n = -d{9i + 92). 

By means of the energy function E, we obtain a presymplectic dynamics on the 
extended phase space TQ © which is equivalent to vakonomic mechanics: 

Theorem 5 Let L: TQ — > M 6e a Lagrangian, and C C TQ a vector subbundle. 
For a path ^ in the manifold TQ © C^, consider the differential equation 

i^n = dEo^. (7.1) 

This equation is equivalent to the equation of motion of vakonomic mechanics in the 
following sense: 

• If ^ = (^1,^2) i'S a solution of ( \7.1\ ) and C,i is the lift of a path in Q, ^1 = 7, 
then 7 is an admissible path (7 is in C) and is a solution of the equation of 
motion of vakonomic mechanics (pT 



Conversely, given an admissible path 7 which is a solution of ^3.4i ), together 
with the multipliers fi\ then the path ^(t) = (7(t), /^i(^)d0*(7(t))) is a solu- 
tion of equation ( 17. 



// the Lagrangian is regular then equation ( \7.1\) already implies that $,1 is the lift of 
a path in Q. 

Proof: It is enough to check the equivalence in local coordinates. We take 
(g", f"^, Aj), a = 1, . . . , n, i = 1, . . . , m, as local coordinates in TQ®C^ (we represent 
an element of as Then we have 

6*1 = — dg" 
02 = A,<(g)dg«, 
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Since E = v"" {dL/dv^) — L, we also have 

Now let us consider the path S,{t) = (g"(t), f"(t), A*(t)), with velocity ^ = 
(g, V, A; q, v, A). Then 

Therefore, equation i^Q = dE is equivalent to the three equations 

q^^ = v^^ (73) 

r< = 0. (7.4) 

The fact that ^1 is the lift of a path 7 in Q means in coordinates that v{t) = q{t), 
so equation ( [7.3|) is an identity. Notice also that if the Lagrangian is regular then 
the Hessian matrix ( 

dv^dv''') invertible, therefore in this case equation ( [7.3| ) implies 
that v{t) = q{t), that is to say, ^1 is the lift of a path in Q. Then, in equation ( [7.4j ) 
we obtain the constraints (p\q,q) = a*(g)g'^ = 0, that is, 7 is an admissible path. 
Finally, we can write equation ( |7.2| ) as 

1 Q^aQ^b + 1 Q^aQqb + 1 Qgb + ' dq- ^ dq- " 



But these are the vakonomic equations (p.4|) of the extended Lagrangian C = L + 
/ija^f °, using the natural identification between the functions /Zj and the coordinates 
\i of the cotangent vectors. □ 

Remark: In a similar way, the vakonomic dynamics can be also defined on the 
manifold C © C°. Since C © C° is a vector subbundle of TQ © C°, we can pull-back 
the 2-form VL and the energy function E to define a 2-form VL and a new function E 
in C © C°. The reader can check that, then the equation of motion of vakonomic 
mechanics (|3.4|) is equivalent to find the paths ^ = (^1,^2) in C © C°, where ^1 is 
the lift of a path in Q, such that 

ifi = dEo^. (7.5) 
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Moreover, if the lagrangian is regular, then is a symplectic form. Notice also that 
this equation, as well as equation ( |7.1D , can also be expressed in terms of vector 
fields. For instance, when Q is symplectic, the solutions of equation ( [7.5|) are the 
integral curves of the vector field X such that 

ij^Q = dE. 



In the case of nonholonomic mechanics, a similar result can be proved, in the 
same way as for theorem Let us denote Qi = — d^i. Then we have: 

Theorem 6 Let L: TQ — > M 6e a Lagrangian, and C C TQ a vector subbundle. 
For a path C, in the manifold TQ © C^, consider the differential equation 

i^ni = dE 0^ + 620^. (7.6) 

This equation is equivalent to the equation of motion of nonholonomic mechanics in 
the following sense: 



If i = (^1) ^2) is a solution of ( \7.(\ ) where is the lift of a path 7 in Q, = 7, 
then J is a solution of the equation of motion of nonholonomic mechanics 



Conversely, given a path 7 which is a solution of ^■^) , together with the 
multipliers fi\ then the path C,(t) = (7(t), yUi(^)d0*(7(t))) is a solution of 
equation (\7.t\). 



If the Lagrangian is regular then equation ( [7. 6| j already implies that C,i is the lift of 
a path in Q. □ 



8 Conclusions 

In this paper we have presented variational calculus (in one dimension) in a geometric 
framework, aiming to study dynamical systems with non-holonomic constraints {i.e., 
constraints depending on the positions and the velocities). We have shown that a 
generalised formulation of variational calculus, in which the admissible paths and 
the admissible infinitesimal variations are not necessarily related, makes room for 
the study of dynamical systems subject to non-holonomic constraints from different 
points of view. This generalized variational calculus encompasses the often-called 
vakonomic mechanics (which is a strict variational problem with constraints) and 
the non-holonomic mechanics (based on d'Alembert's principle). 
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In the case of vakonomic mechanics, we have provided a geometric procedure to 
obtain the equation of motion, choosing an appropriate set of admissible infinitesimal 
variations proving that they always exist. 

In the case of non-holonomic mechanics, it is far more simple than in vakonomic 
mechanics to choose an appropriate set of admissible infinitesimal variations, and 
the corresponding equation of motion is readily obtained. 

Our formulation also provides a neat equivalence between both vakonomic and 
non-holonomic mechanics when the constraints are integrable (also called holo- 
nomic). 

We have also found the geometry lying on some particular cases of vakonomic 
mechanics, namely the case of optimal control and the case where the constraint 
submanifold is a vector subbundle of the tangent bundle. 

All the paper is written for the case of time-independent lagrangian and con- 
straints, but the reader may check that the time-dependent case may be dealt with 
by adjunction of the time variable in a not too involved way. 
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